Fair Boosting: a Case Study
نویسنده
چکیده
We study the classical AdaBoost algorithm in the context of fairness. We use the Census Income Dataset (Lichman, 2013) as a case study. We empirically evaluate the bias and error of four variants of AdaBoost relative to an unmodified AdaBoost baseline, and study the trade-offs between reducing bias and maintaining low error. We further define a new notion of fairness and measure it for all of our methods. Our proposed method, modifying the hypothesis output by AdaBoost by shifting the decision boundary for the protected group, outperforms the state of the art for the census dataset. Although there are several papers on “fair” versions of learning algorithms such as naive Bayes, decision tree learning or logistic regression, boosting, which is one of the most successful and most widely used machine learning algorithms, has not been studied in the context of fair learning before. In addition to its popularity, boosting is an interesting framework in which to study fairness because notions such as a weak learner and the boosting margin have natural interpretations for fairness. We rigorously define these notions in Section 2 and analyze them in Section 3. Following previous literature, we assume that the training data is biased against data points with a given feature value but we do not have access to the unbiased ground truth. We want to learn a classifier which has minimal error (as evaluated on the biased data) among all classifiers that achieve statistical parity. Dwork et al. (2012) point out that bias represents a notion of group fairness rather than individual fairness, and that it is still possible to discriminate against individuals even when achieving statistical parity. Thus, in addition to learning a classifier that has both low bias and error, we want a classifier that performs well on a measure of individual fairness. In this paper, we introduce a notion of fairness that captures how resistant a classifier is to bias generated independently at random against data points wtih a given feature value. The Census Income Data Set (Lichman, 2013) is a widely used data set for machine learning research in which the learner’s goal is to predict whether an individual’s income exceeds $50k per year based on census data such as age, education, gender, and marital status. In particular, when considering gender as a protected attribute, the dataset exhibits high bias. We use this data set as a case study to understand the fairness properties of the AdaBoost algorithm of Freund & Schapire (1997). We provide information about the Census data set in Section 1. A primary advantage to using boosting is that boosting has a natural notion of confidence which we can take advantage of to try to decrease bias while keeping error low. Our main empirical finding is that after boosting is performed to produce a hypothesis h, flipping the output label of h according to the boosting signed confidence of the protected group outperforms the state of the art on the Census dataset both in terms of bias and label error. We compare this to data massaging (introduced by Kamiran & Calders (2009)), replacing a standard weak learner with a “fair” weak learner, and i.i.d. random relabeling. Finally, in Section 4 we interpret and discuss our results.
منابع مشابه
Improving reservoir rock classification in heterogeneous carbonates using boosting and bagging strategies: A case study of early Triassic carbonates of coastal Fars, south Iran
An accurate reservoir characterization is a crucial task for the development of quantitative geological models and reservoir simulation. In the present research work, a novel view is presented on the reservoir characterization using the advantages of thin section image analysis and intelligent classification algorithms. The proposed methodology comprises three main steps. First, four classes of...
متن کاملThe Study of Effective Factors on the Creation and Boosting Organizational Morale (Case Study: Students of a Military University)
Introduction: Studying on employees' morale is one of the effective ways to assess the success or failure of organizations for achieving their goals. Objective the present study aimed to investigate the impact of managers’ organizational trust and managerial competency on the students' organizational morale. Material and Method: This research is a descriptive-analytical and survey-based study....
متن کاملArchitectural Support for Compile-time Speculation
Studies on instruction-level parallelism (ILP) have shown that there are few independent instructions within the basic blocks of non-numerical applications. To uncover more independent instructions within these applications, instruction schedulers and microarchitectures must support the speculative execution of instructions. This paper describes an architectural mechanism for speculative execut...
متن کاملSecretly Embedding Trapdoors into Contract Signing Protocols
Contract signing protocols have been proposed and analyzed for more than three decades now. One of the main problems that appeared while studying such schemes is the impossibility of achieving both fairness and guaranteed output delivery. As workarounds, cryptographers have put forth three main categories of contract signing schemes: gradual release, optimistic and concurrent or legally fair sc...
متن کاملOutlier Detection by Boosting Regression Trees
A procedure for detecting outliers in regression problems is proposed. It is based on information provided by boosting regression trees. The key idea is to select the most frequently resampled observation along the boosting iterations and reiterate after removing it. The selection criterion is based on Tchebychev’s inequality applied to the maximum over the boosting iterations of ...
متن کامل